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Abstract 

We consider the problem of two transmitters wishing to exchange information through a relay in the middle. 
The channels between the transmitters and the relay are assumed to be synchronized, average power constrained 
additive white Gaussian noise channels with a real input with signal-to-noise ratio (SNR) of snr. An upper bound 
on the capacity is ilog(l + snr) bits per transmitter per use of the medium-access phase and broadcast phase of 
the bi-directional relay channel. We show that using lattice codes and lattice decoding, we can obtain a rate of 
^ log(^ + snr) bits per transmitter, which is essentially optimal at high SNR. The main idea is to decode the sum 
of the codewords modulo a lattice at the relay followed by a broadcast phase which performs Slepian-Wolf coding 
with structured codes. For asymptotically low SNR, joint decoding of the two transmissions at the relay (MAC 
channel) is shown to be optimal. We also show that if the two transmitters use identical lattices with minimum 
angle decoding, we can achieve the same rate of ^ log(^ + snr). The proposed scheme can be thought of as a joint 
physical layer, network layer code which outperforms other recently proposed analog network coding schemes. 

I. Introduction, System Model and Problem statement 

We consider the bi-directional relaying problem where two users try to exchange information with each other 
through a relay in the middle. More specifically, we study a simple 3-node linear Gaussian network as shown in 
Fig. [T] Nodes A and B wish to exchange information between each other through the relay node R, however, nodes 
A and B cannot communicate with each other directly. Let ua G {0, l}*^ and G {0,1}'^, be the iirformation 
vectors at nodes A and B (vectors are denoted by bold face letters such as u throughout the paper). The information 
is assumed to be encoded into vectors (codewords) xi G R" and X2 G M" at nodes A and B, respectively, and 
transmitted. We assume that communication takes place in two phases - a multiple access (MAC) phase and a 
broadcast phase, which are briefly described below. 

a) MAC phase: During the MAC phase, nodes A and B transmit xi and X2 in n uses of an AWGN channel 
to the relay. It is assumed that the two transmissions are perfectly synchronized and, hence, the received signal at 
the relay G M" is given by 

yij = xi + X2 + z 

where the components of z are independent identically distributed (i.i.d) Gaussian random variables with zero mean 
and variance a'^. Further, it is assumed that there is an average transmit power constraint of P at both nodes and, 
hence, £^[||xi||2] < nP and ^[||x2|p] < nP. 

b) Broadcast phase: During the broadcast phase, the relay node transmits x/j G M" in n uses of an AWGN 
broadcast channel to both nodes A and B. It is assumed that the average transmit power at the relay node is also 
constrained to P and that the noise variance at the two nodes is also a^. Node A forms an estimate of u^, namely 

and node B forms an estimate of u^, namely u^. An error is said to occur if either 7^ or ua ua, 
i.e., the probability of error is given by 

Pe = Pr ({u^ / u^} [j{uB / ub}) 

It is assumed that the communications in the MAC and broadcast phases are orthogonal. For example, the 
communications in the MAC and broadcast phase could be in two separate frequency bands (or in two different 
time slots) and, hence, the MAC phase and broadcast phase do not interfere with each other. To keep the discussion 
simple, we will assume that the MAC and broadcast phases occur in different time slots. This can be easily 
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Fig. 1. System Model with 3 Nodes 

generalized to the case when 2n dimensions are available for communication, out of which n dimensions are 
allocated to the MAC phase and n dimensions are allocated to the broadcast phase. Note that the signal-to-noise 
ratio (SNR) for all transmissions is snr = P/a^ and, hence, we refer to it as simply the SNR without having to 
distinguish between the SNRs of the different phases. Similarly, we restrict our attention to the case when the both 
nodes A and B wish to exchange identical amount of information and, hence, we can simply refer to one exchange 
rate without having to distinguish between the rate for A and B separately. 

Formally, we define the exchange rate Rex, scheme for an encoding/decoding scheme as the maximum information 
rate that can be exchanged reliably, i.e., 

Rex,scheme — max — : > as n > oo. 
n 

The exchange capacity Cex is then the supremum of Rex,scheme over all possible encoding/decoding schemes. 



II. Main Results and comments 

We mainly consider the case when the MAC and broadcast phase are both restricted to using exactly n uses of 
the channel each. For this case, the main results in this paper are 

• The exchange capacity satisfies 

1 / P 

Cex < -log( 1 + ^ 

because the MAC and broadcast phase each consist of n AWGN channel uses. 

• An exchange rate of 

_ 1 /I P 

Rex,LatUce — ^ i 2 ~'~ CJ^ 



is achievable using the lattice coding scheme with dither and lattice decoding discussed in Section VI The 



same exchange rate can also be obtained using the lattice coding scheme (without dither) and minimum angle 



decoding discussed in Section VII At high SNR, these lattice based coding and decoding schemes are nearly 



optimal because their rates approach the upper bound. 
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An exchange rate of 



ex,JD 



log 



. 2P\ 



is achievable using the joint decoding scheme in Section VIII At low SNR, these scheme is nearly optimal 
because the rate approaches the upper bound. 

Clearly, any rate of the form (3Rex,jD + (1 — [i) Rex, Lattice is achievable for any < /3 < 1 by time sharing 
between the two schemes. This outperforms the recently proposed analog network coding idea in [3] over the 
entire range of SNR. 



III. Related Prior Work 

Recently, there has been a significant amount of work on coding for the bi-directional relay problem [l]-[7]. 
In [2], Katti et al., showed the usefulness of network coding for this problem. Although they do not consider the 
physical layer explicitly in this work, the natural extension of their solution to our problem would work as follows. 
The 2n channel uses available for signaling would be split into three slots with 2n/3 channel uses each. In the 
first time slot, vla is encoded using an optimal channel code for the AWGN channel into xi and transmitted from 
node A. Similarly, in the second time slot is encoded into X2 and transmitted from node B. At the relay, 
and are decoded and then the relay forms u^j = ® and encodes into x/j using an optimal code 
for the Gaussian channel and broadcasts into both nodes. The two nodes decode and then since they have 
and VLB, they can obtain and vla at the nodes A and B, respectively. Here, the physical layer and network 
layer are completely separated and coding (or mixing of the information) is performed only at the network layer. 
In the system model considered in Fig. [T| the physical layer naturally performs mixing of the signals from the two 
transmitters. The schemes that take advantage of this can be referred to as joint physical layer coding and network 
coding solutions. 

One such scheme called analog network coding was recently proposed in [3]. In this case, the MAC phase and 
broadcast phase use n uses of the AWGN and are orthogonal to each other. Gaussian code books are used at the 
transmitters to encode u^^ into xi and into X2, respectively. Analog network coding is an amplify (rather than 
scale) and forward scheme where the received signal at the relay during the MAC phase y/j, is scaled to satisfy 

the power constraint and transmitted during the broadcast phase, i.e., x/j = ^J^^^Yr- It can be seen that this 

scheme can achieve an exchange capacity of ^ log + tft^^ > which is higher than that achievable with the pure 

network coding scheme in [2] for high SNR. 

The schemes proposed in this paper can be thought of as decode and forward schemes which outperform the 
amplify and forward scheme in [3]. In a very recent work [6], it is conjectui^ed that an exchange rate of ^ log (l + ^) 
can be achieved, however, no scheme is given or even conjectured. The scheme in this paper is a constructive scheme 



that performs close to the rate conjectured in [6]. The lattice decoding scheme discussed in Section VI is similar 
to that used by Nazer and Gastpar [20] for the problem of estimation the sum of two Gaussian random variables, 
but was independently proposed in the previous version of this paper [19]. 



IV. An Optimal Transmission Scheme for the BSC Channel 

To motivate our proposed scheme, we first consider a system where the physical layer channels are all binary 
symmetric channels, i.e., xi G {0, 1}", X2 G {0, 1}" and the signal received at the relay is 

y/? = xi©X2©e (1) 

where © denotes binary addition and e is an error sequence whose components are or 1 with probability q and 
1 — q respectively and are i.i.d. Similarly, during the broadcast phase also let the channel be a BSC channel with 
crossover probability q. In this case, an upper bound on the exchange capacity can be seen to be 1 — H{q) since 
this is the maximum information that can flow to any of the nodes from the relay. This can be achieved using the 
following coding scheme. 

In this scheme, the two nodes use identical capacity achieving binary linear codes of rate 1 — H{q). Consider 
again the received signal at the relay given in ([T]). Notice that since xi and X2 ai^e codewords of the same linear 
code, xi © X2 is also a valid codeword from the same code which achieves capacity over a BSC channel with 
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crossover probability q. Hence, the relay can decode xi © X2 and transmit the result during the broadcast phase. The 
nodes A and B can also decode xi © X2 and since they have xi and X2, they can obtain X2 and xi, respectively. 
This scheme achieves an exchange rate of 1 — H{q) and is therefore optimal. 

Random codes versus structured codes: It is quite interesting to note that if random codes, i.e., codes from 
the Shannon ensemble were used instead of linear codes, xi © X2 cannot be decoded at the relay. The linearity (or 
group structure) of the code is exploited to make xi © X2 decodable at the relay and, hence, structured codes with a 
group structure outperform random codes for this problem. Examples of schemes were structured codes outperform 
random codes have been given in Korner and Marton [8] and more recently by Nazer and Gastpar in [9], [10]. 



V. Upper Bound on the Exchange Rate for Gaussian links 

Let us now return to the problem outlined in Section |lj where the channels between the nodes and the relay 
are AWGN channels. We restrict our attention to schemes where the MAC phase and broadcast phase are both 
orthogonal to each other and use n channel uses (or dimensions). With this restriction, a simple upper bound on 
the exchange rate can be obtained as follows. Consider a cut between the relay node and node A. The maximum 
amount of information that can flow to either of the nodes from the relay is \ log (l + §2)- Hence, the exchange 
capacity is upper bounded by 

Cex,ub = ^1 + • 

We now consider coding schemes and analyze their performance. 



VI. Nested Lattice Based Coding Scheme with Lattice Decoding 



As shown in Section IV for the BSC channel, codes with a group structure (linear codes) enable decoding of a 
linear combination (or, sum) codewords at the relay. This motivates the use of lattice codes for the Gaussian channel 
since lattices have a similar group structure with respect to real vector addition. We begin with some preliminaries 
about lattices [12], [14]. 

An n-dimensional lattice A is a subgroup of M" under vector addition over the reals. This implies that if 
Ai, A2 € A, then A1 + A2 € A. For any x G M", the quantization of x, (5a(x) is defined as the A G A that is closest to 
X with respect to Euclidean distance. The fundamental Voronoi region V(A) is defined as V(A) = {x : Qa(x) = 0}. 
The mod operation is defined as (x mod A) = x — (5a(x). This can be interpreted as the error in quantizing an 
X to the closest point in the lattice A. The second moment of a lattice is given by 

a\K) = -}—l. [ llxipdx, 

and where ^(A), the volume of the fundamental Voronoi region is denoted by ^(A) = /y^^^^ dx. The normaUzed 
second moment of the lattice is then given by 

G(A) = (j2(A)/y(A)i/". 

Let us define the covering radius of a lattice as the radius of the smallest n-dimensional hyper sphere 
containing the Voronoi region V(A). Also let TZi denote the effective radius of V(A), which is the radius of the 
n-dimensional hyper sphere having the same volume as V(A). Now we can define a Rogers-good Lattice [12, (69)] 
as 



1<(^) <c•n•(logn)^ (2) 



where c and a are constants. This implies. 



-log|^ = of-). (3) 



In this work, we are interested in nested lattices. Formally, we can say the lattice Ac (the coarse lattice) is nested 
in the lattice Aj (the fine lattice) if Ac C Aj [13]. Let the fundamental Voronoi regions of the lattices, Ac and 
A/ be V(Ac) and V(A/). The existence of nested lattices for which G(Ac) ^ l/(27re) and G{Af) l/(27re) has 
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Fig. 2. Lattice code based scheme sliowing tlie transmitted signals ti, t2 and tiie decoded signal at the relay t 



been shown in [13], [14]. The number of lattice points of the fine lattice in the basic Voronoi region of the coarse 
lattice is given by the nesting ratio yj^^ ■ 

Lattice codes can be used to achieve capacity on the single user AWGN channel under maximum likelihood 
(ML) [15], [17], [11]. More recently, nested lattices have been shown to achieve the capacity of the single user 
AWGN channel under lattice decoding [12], [14]. The main idea in [12], [14] is to use the coarse lattice as a 
shaping region and the lattice points from the fine lattice contained within the basic Voronoi region of the coarse 
lattice as the codewords. The existence of good nested lattices that achieve capacity has been shown in [12]. 



A. Description 

We now describe our encoding and decoding schemes for the bi-directional relaying problem using nested 
lattices. The encoding and decoding operations during the MAC and broadcast phase are explained below. A 
general schematic is also shown in Fig. |2] 

MAC Phase: Let there be k information bits in the information vector and and, hence, the exchange rate 
is i? = k/n. At node A, the information vector ua is mapped onto a fine lattice point ti G {Aj n V(Ac)}, i.e., the 
set of all fine lattice points in the basic Voronoi region of the coarse lattice is taken to be the code. An identical 
code is used at node B and the information vector is mapped onto the codeword t2 € {Aj n V(Ac)}. We then 
generate dither vectors di and d2 which are randomly generated n dimensional vectors uniformly distributed over 
V(Ac). The dither vectors are mutually independent of each other and are known at both the relay node and the 
nodes A and B. Now node A and node B form the transmitted signal xi and X2 as follows 

xi = (ti - di) mod Ac (4) 

X2 = (t2 - d2) mod Ac (5) 

By choosing an appropriate coarse lattice with second moment P, the transmit power constraint will be satisfied 
at both nodes. Assuming perfect synchronization, the relay node receives y/j given by 

yij = XI + X2 + z (6) 

where z is the noise vector whose components have variance a^. 
The main idea is that the relay decodes 

t = (ti+t2) mod Ac 

from the received signal y/j. 
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Broadcast Phase: In the broadcast phase, the relay node transmits the index of t using a capacity achieving code 
for the AWGN channel. The index (or, equivalently, t) can be obtained at the nodes A and B, and using a mod A 
operation, t2 and ti can can be recovered at nodes A and B, respectively. This scheme can also be thought of as a 
decode and forward scheme where the relay decodes a function of ti and t2, namely t = (ti + 12) mod Ac and 
forwards this to the nodes. Notice however, that the relay will not know either ti or t2 exactly, it only knows t. 



B. Achievable rate 

Before we discuss the achievable exchange rate with the lattice encoding scheme and decoding scheme discussed 
above, we need a few definitions and lemmas. 

The coding rate of the nested lattice code is defined as the logarithm of the number of lattice points of the fine 
lattice in A/ n V(Ac) which is given by 

ilog|A,nV(Ad| = ilog^ 

Lemma 1: Let ti and t2 be independent random variables which are uniformly distributed over the set of all fine 
lattice points in V(Ac), i.e., Pr[ti = A] = 1/2"^, VA G {A/nV(Ac)} and Pr[t2 = A] = 1/2''^, VA G {A/nV(Ac)}, 
where R is the coding rate of the nested lattice code. Then, t = {ti + t2) mod A is also uniformly distributed 
over {A/ n V{K)}. 



Proof: 



Pr[* = A] = J]Pr[ti = A,]Pr[i = A|ti = Ai] (7) 

i=l 

= 2ni? ^[(^^ + *2) mod Ae = A|ti = \i] (8) 

i=l 

= P'"[*2 = (A - \i) mod K\ti = A,] (9) 

i=l 

11 1 

~ ^ 2nR 2nR ~ ^^^^ 
1=1 



We restate some of definitions in [12]. Let TZu be the covering radius of Ac and let BilZy) be the n-dimensional 
ball of radius TZu- Let be the second moment (per dimension) of the smallest ball containing V(Ac), i.e., 

= Ml / M?du. (11) 



n\\B{nu)\\ Jb(tz^) 

Note that V(Ac) has a second moment P and hence P < p^. Let Z* be a random variable given by Z* = 
(1 - a)(Zi + Z2) + «Z, with Zi ~ N{<d, p'^V), Z2 ~ M{Q, p^V) and Z ~ A/'(0, a^P), where P is the n x n 
identity matrix. The variance of Z* satisfies 

Lemma 2 (Modified version of [12, Lemma 6]): 

" < Var(Z*) = (l-a)2(p2 + p2) + ^2^2 < /'M'J^^ (12) 



n + 2'2P + a^- ^ ^ ^ J ' y J ' -\RiJ 2P + a'^' 

where TZu is the covering radius of the coarse lattice Ac and TZi is the radius of the n-dimensional hyper sphere 
whose volume is equal to the volume of the basic Voronoi region V(Ac). 
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Proof: The proof of the above lemma closely follows the proof in [12]. Equations [12, 116-118] can be used 
to get a bound on Var(Zi) and Var(Z2), i.e., p^. Choosing a = 2P+a^ ' modified version of [12, Lemma 6] can 
be proved. ■ 
Let Xi and X2 be two independent random variables which are uniformly distributed over V(Ac) and let 
Z ~ AA(0, cr^I) be an n-dimensional Gaussian vector independent of Xi and X2. Further, let Zgg = (1 — a)(Xi + 
X2) + aZ, where a = 2P+a^ ■ Notice that Zgg is not Gaussian. We next state a Lemma which is a modified version 
of [12, Lemma 11] by Erez and Zamir which essentially shows that there exist good lattices for which Zgg can be 
well approximated by a Gaussian of nearly the same variance and the approximation gets better as n — > 00. 

Lemma 3 (Modified version of [12, Lemma 11]): Let Ac be a lattice which is both Rogers-good as well as 
Poltyrev-good. Then, for any x, 

/^^,(x) <e^^(^>/z.(x), (13) 

where ei(Ac) ^ as n ^ 00. 

Proof: To prove the modified version of [12, Lemma 11], we can repeat the steps in [12], and equation [12, 
200] can be restated with the notation in this paper as 

-log|^ = ei(Ae) = of-y (14) 
n /zi(x) \nj 

Similarly for Z2, we get 

'logf^ = .i(A.) = o(i). (15) 



Combining ( 14) and ( 15 1 and also the definition of Z* as Z* = (1 — a)(Zi + Z2) + aN, we can get the proof of 



the modified version of [12, Lemma 11]. ■ 

We next state the theorem which is an application of the above Lemmas. This is very similar to [12, Theorem 
5]. 

Theorem 4 (modified version of [12, Theorem 5]): Let xi and X2 be realizations of two independent random 
variables which are uniformly distributed over V(Ac) and let z be a realization of an n-dimensional Gaussian 
vector Z ~ AA(0,cj^I). Further, let z^q = (1 — a)(xi + X2) + qz, where a = 2P+a^ ■ coding rate 

R < ^ log(5 + i^)' there exists a sequence of n-dimensional nested lattice pairs (aJ."^ A^"^) of coding rate R for 
which 

Pr{Zeg Vj"^} ^ 0, as, n ^ 00. 

Proof: The proof follows closely the proof of [12, Theorem 5]. We mention the places where the proof in [12] 
that have to be modified. Equation (81) in [12] must be modified to take into account that we have two transmitters. 
Equation [12, (81)] must be modified with Lemma 2. Also Equation [12, (82)] must be modified with Lemma 3. 
After this, we can continue with the proof in [12] by calculating the Poltyrev exponents and also using the fact 
that Rogers-good and Poltyrev-good lattices exist. Continuing with these steps in [12] shows that we can obtain 
the rate of i? < ^ log (2 + j^). which proves the theorem. ■ 

We are ready to present the main theorem in this Section which is given below. 

Theorem 5: For the lattice coding scheme described above, any exchange rate Rex,iattice < ^^og{^ + ^) is 
achievable with lattice (Euclidean) decoding. 

Proof: 

MAC Phase: We will first show that the probability of error in decoding t from yji can be made arbitrarily small 
for asymptotically large n. During the MAC phase, the relay tries to decode to 

t = (ti + t2) mod A 
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from the received signal yji (given in ([6j) as follows. The decoder at the relay node forms t = (ay/j + di + 
mod Ac (a will be determined later) and finds the lattice point in the fine lattice that is closest to t, i.e., the estimate 
of t is QA/(t)- Using the distributive property of the mod operation, t can be written as: 

t = (ay + di + d2) mod Ac 

= (a(xi + X2) + az + di + d2) mod Ac 

= (xi + X2 + di + d2 + az — (1 — a)(xi + X2)) mod Ac 

= ((ti-di) mod Ac + (t2 - d2) mod Ac + 
di + d2 + az — (1 — a)(xi + X2)) mod Ac 

= ((ti+t2) mod Ac + az — (1 — a)(xi + X2)) mod Ac 

= (t + az - (1 - a)(xi + X2)) mod Ac (16) 

Due to the group structure of the lattice, t is a lattice point in the fine lattice (more precisely, t S {AjR V(Ac)}). 
From Lemma 1, it can be seen that t is uniformly distributed over V(Ac). Further, note that ti and t2 are independent 
of z, xi and X2 and, hence, we can define an equivalent noise term as Zgg = az — (1 — a)(xi +X2) such that t and 
Zeq are independent of each other. The second moment of Zeq is given by C7c\ = E[Z^,g] = a^a^ + (1 - af2P. 
We can now choose a to minimize E[z'^q] and the resulting optimum values of a and fjgg are aopt = 2P+a^ ^^'^ 
(^eq,opt — 2P+a^ ■ Ffom Thcorcm 4| it can be seen that there exist nested lattices of rate RiatUce < \ log ( 5 + 
for which Pr{Leq vl"^} ^ 0, as, n —> 00. and, hence, the probability of decoding error 



p 



Hence, we can use a rate of 



Pe = Pr{QA^(t) / t} ^ as n ^ 00. 



Rhattice = \ log2 |A/ H V(Ac)| < ^ log Q + ^ (17) 

at each of the nodes and (ti + t2) mod Ac can be decoded at the relay. 
Broadcast Phase: 

In the broadcast phase, the relay node transmits the index of t using a capacity achieving code for the AWGN 
channel. Since the capacity of the AWGN is \ log (l + ^) which is higher than RiatUce in ( [l7| and, hence, the 
index (or, equivalently, t) can be obtained at the nodes A and B. Since node A already has and, hence, ti, it 
needs to recover ub or, equivalently, t2, from t and ti. This can be done as follows. Node A computes (t — ti) 
mod Ac, which can be written as 



(t — ti) mod Ac = ((ti+t2) modAc — ti) mod Ac 
= t2 mod Ac 

= t2 (18) 

Similarly, ti can also be obtained in node B by taking (t — 12) mod Ac. Hence, an effective rate of Rex,Lattice < 
^ log (5 + ^) can be obtained using nested lattices with lattice decoding. ■ 

We conclude by noting that, at high SNR, this scheme approaches the upper bound of ^ log {l + and is 
therefore nearly optimal. 

This scheme can be interpreted as a Slepian-Wolf coding scheme using nested lattices, i.e., the relay wishes to 
convey ti + t2 to node A, where some side information (namely, ti) is available. Thus, the broadcast phase in 
effect uses nested lattices for solving the Slepian-Wolf coding problem. This scheme can also be thought of as a 
decode and forward scheme where the relay decodes a function of ti and t2, namely t = (ti + 12) mod Ac and 
forwards this to the nodes. Notice however, that the relay will not know either ti or t2 exactly, it only knows t. 

Since the nested lattice code we have used is a capacity achieving code for the AWGN channel, one does not 
have to encode t again. The relay can simply transmit t to the nodes A and B and t can be decoded at the nodes 
A and B in the presence of noise at the nodes A and B. Notice that ii^[||t|p] < P and, hence, the power constraint 
will be satisfied at the relay node. 
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VII. Lattice Coding with Minimum Angle Decoding 

In the previous section we observed that nested lattice decoding can achieve an exchange rate of ^ log(^ + ^) 
with lattice decoding alone. This naturally leads us to the question, can we achieve a better performance by using 
other decoding schemes? In this section we study a suboptimal decoder called the minimum angle decoder [11]. 

A. Description 

We next briefly explain our minimum angle decoding scheme. We have two transmitters communicating si- 
multaneously to a central router. Both of them have the same power constraint P. The noise in the channel is 
Gaussian having a variance cr^. As we have seen in previous sections, choosing a good lattice code provides us 
with a considerable rate gain compared to random codes. Here we consider a n-dimensional lattice A„ C H". Let 
T ^ be an n-dimensional closed ball, centered at the origin and having a radius \JnP, and let 14(\/nP) be the 
hyper- volume of T This can be treated as a power constraint. Our codewords will be composed of lattice points 
in the sphere T ^p. Our encoding strategy would be that, each transmitter chooses a lattice point corresponding 
to its message index and transmits synchronously over the the Gaussian channel. Here we have no nested lattice 
construction or the use of dither in the encoding stage. At the receiver we will be interested in decoding to the 
sum of these lattice points. 

Minimum Angle Decoder: A minimum angle decoder discussed here makes a decision based on lattice points 
in a thin n-dimensional spherical shell = {x G R" : ^ni^P — S) < ||x|| < -^/n(2P~+~5)}, S is small and 

non-zero. It takes the received vector ana finds the lattice point, whose projection on the thin shell, is closest to 
the received vector. 

An optimal decoder can be seen to be a maximum likelihood decoder. However, it is very tough to analyze this 
decoder. Hence, we analyze the minimum angle decoder, which is more tractable analytically. We next state the 
main theorem of this section, 

B. Achievable rate 

Theorem 6: For the bi-directional relaying problem considered in Section [T| there exists at least one n-dimensional 
lattice Kn such that an exchange rate of Rex < \ log(| + SNR) is achievable using a minimum angle decoder as 

n — > oo. 



Proof Sketch 




Fig. 3. Picture showing the concentration of the sum of lattice points on the thin shell of radius \/n2P. 
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It is well known that the volume of an n-dimensional sphere is concentrated mainly on the surface of the sphere 
as the dimension becomes large. It is also known that, if we intersect a lattice with a n-dimensional sphere, then 
most of the lattice points will be concentrated very close to the surface [11]. In the course of our proof, we will 
show that the sum of any two such randomly chosen lattice points is also concentrated on a thin spherical shell 
at a radius of ^j2nP. Hence, the probability of error will be largely depended on the lattice points in the 
thin spherical shell T^^. We will be using the Blichfeldt's principle to show that there exist translations (one for 
each user) of the lattice A„ where the sum points are concentrated (see Theorem [9j Lemma 12 and Lemma 13 1. 



Once concentration for the sum of lattice points can be established, we can perform minimum angle decoding. In 
minimum angle decoding, we are interested only in the angle between the different lattice points on the thin spherical 
shell. It must be noted that the choice of the lattice A„ must be such that it must act as a good channel code. The 



Minkowski-Hlawka theorem (Theorem 11 and Lemma 14 1 can be used to show existence of such lattices. Choosing 



volume of the lattice's Voronoi region appropriately allows us to compute an achievable rate of this scheme. 

C. Detailed proof 

First let us provide some definitions and notation, 

• An denotes an n-dimensional lattice and let P„ be the basic Voronoi region of the lattice. 

• si,S2 are two n-dimensional translational vectors, si,S2 E Pn- 

• £(s, T) is defined as the set of lattice points translated by a vector s present in the region T or otherwise, 

c{s,T) = (A„ + s)nr. 

• T and T represents n-dimensional hyper-spheres centered at the origin and having a radius of \/nP 
and Vn2P respectively. 

• represents n-dimensional thin spherical shell around the radius \fv2P and is defined as T^^^^p = ^ 
: y^n(2P - S) < ||x|| < y^n(2P + (^)}, 6 is small and non-zero. 

• Ci and C2 are codewords formed by intersection of lattice points of hyper-spheres and are given by, Ci = 

£(si, T^) = (A„ + si) n T/p and C2 = C{s2,T^) = (A„ + S2) n T^. 

• Similarly C ^j^p is given as C ^^^p = -^(si + 82,7" ^/2p) = (An + si + S2) n T and C^^p given by 

= £(si + S2, r^) = {K + si + S2) n T^. 

• C2 are codewords formed by intersection of lattice points of hyper-spheres and are given by, Ci = £(si , T ^) = 
(A„ + si) n and C2 = £(s2, T^) = (A„ + S2) n T^. 

• Let C0 be defined as = Ci x C2 = {(xi,X2) : xi G Ci, X2 G C2}. This denotes the combined collection of 
pairs of codewords of both the transmitters. 

• Again let = {(xi,X2) : xi + X2 G ^.^p'^i ^ Ci,X2 G C2} This denotes the codeword pairs whose sum 
lies on the thin shell T^^. 

• Let = C© \ . This denotes the code word pairs whose sum does not lie on the thin shell. It must be 
noted that the set formed by the sum of codewords in C© need not be the same as and at low SNR this 
may lead to significant difference between ML and minimum angle decoding. 

• Let Ml, M2, M®, and denote the cardinaUty of Ci, C2, C®, and respectively. 

• For a given code C, let us denote the average probability of error, under minimum distance decoding as P''. 

• Let us define a projection function vr. This projects a n dimensional vector onto to an inner sphere of radius 
^Jn(2P-5). It is defined as 7r(x) = ( v^n(2P - (5)/||x||)x. 

It is easy to see that minimum distance decoding is equivalent to maximum likelihood decoding in the presence 
of Gaussian noise. 

As mentioned before the set of lattice points whose sum lies in the thin spherical shell is much larger than 
the lattice points whose sum lies outside the spherical region. Hence the average probability of error will not be 
affected much by these lattice points. This motivates us to express the average probability of error P''® as a sum 
of two terms as made more clear in the lemma below[ll]. 

Lemma 7: ^ 

- Me Me ^ ^ ^ ' ^" 

Proof: 
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Let the ordered pair (xi,X2) £ C®, denote that xi G Ci and X2 G C2. We next follow similar steps of the 
proof as given in [11]. Let P'^®(xi,X2) = P^®(xi +X2), denote the probability of error in decoding to the sum 
(xi + X2), for a pair (xi,X2) G C®. xi and X2 are transmitted simultaneously by user 1 and 2 respectively and 
the router receives the sum xi + X2 corrupted with some Gaussian noise. Now let P*^® , represent a suboptimum 
decoder which maps the received point to the nearest sum Xi + X2, where (xi,X2) G C^. Hence we can bound 
the average probability of error as follows 



~ 1 

~ Mff 



(xi,X2)eC^ 



(xi,x2)eC| 



(a) 

Me Mg: 

M^ 1 

Me ^MgE 

(b) M' 1 

< H 

- Mffi Mff 



E ^nxi,x2) 

{xi,x2)eCg 

E P^*(X1 + X2) 

(xi,x2)ec^ 

E P"^^®)(X1 + X2) 

(xi,x2)ec^ 



Here in (a) the first term follows since M^ is the cardinahty of C® and the probabihty of error using our 
suboptimal decoder is 1, when (xi,X2) G Ce- In (b) tt refers to the projection function defined before and P'^('^e), 
represents the minimum angle decoder described before. The inequality can be shown to be true by referring to 
the discussions on [11, Lemma 2]. 

■ 

Next since we are interested in lattice points projected on to the iimer sphere of radius \/n{2P — S), we can 
define a decoding algorithm that looks at the angle between the lattice points, the minimum angle decoder. We 
next establish some more definitions. Let Pe)(y) denote a n-dimensional cone centered at the origin and having the 
axis passing through y. Let 9 be the half-angle of the cone and y G R" be non-zero. 

We next define a sub-optimum decoding function given as follows, 

Ae(xi,X2) =Pe(Xl+X2)\ U Be(x') 

x'eC^\{xi+X2} 

or this can also be expressed as 

A^(xi,x2) =Se^(xi+x2) U Se(x') 

x'eC^\{xi+X2} 

Ag(xi, X2) represents the region of the cone ^^(xi -I-X2), that does not intersect with any other cone corresponding 
to the other lattice codeword points x', located in the thin spherical shell. During decoding, when we receive a 
vector that falls in the region yl6i(xi,X2), we decode to the sum codeword (xi + X2). It may not be possible to 
decode to the individual codewords xi and X2, as different pairs of codewords may yield the same sum. However 
it must be noted, that in the forward phase, we are interested in decoding only to the sum of the transmitted 
codewords. 
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Let Pq denote the probabiUty of error using the sub-optimum decoder. Then, we have 
P;^''"^(xi,X2) = Pr(7r(xi+X2) + Z"GA^(xi,X2)) 

< Pr(7r(xi+X2) + Z" 0Se(xi + X2))+ Pr (^(xi + X2) + G 5e(x')) 

Pr(to + Z" 0i?e(to))+ Pe(xi + X2,x') 

= Pr(to + ^" 50(to)) + V P0(xi+X2,<? + Xl+X2)XT^^(5 + Xl+X2) 

96A„\{0} 

In the above (a) follows because we use the union bound. In (b), the first term follows, because due to symmetry, 
the probability is not dependent on the particular xi + X2. The second term follows as we define p6)(x, x') as 
P6i(x, x') = Prfvrfx) + Bg(-x.')). In (c), we replace x', by g( + xi + X2 and the characteristic function xt^ > 

corresponds to lattice points on the thin shell at radius \/n2P. 
Hence the average probability of error can be bounded as 

M' 

<Pr (to + i?,(to)) + --^+ 

M© 

1 ^ . . < ^ ^^^^ 

1^ 1^ P0(xi + X2,fi' + xi +X2)xtv(5' + xi +X2) 

^ (xi,x2)eC^9eA„\{0} 

The rest of the proof deals with bounding each of the three terms in the above equation by an arbitrarily small 
quantity, to make the probability of error tend to zero as n ^ 00. Below we briefly explain the requirements. 

• The first term can be made very small by choosing the angle 9 appropriately. In effect, we need the noise Z" 
to be contained inside the cone -Be (to) with high probability as the dimension n becomes large. 

• For the second term we need the number of codeword pairs whose sum of codewords lies outside the thin 
spherical shell must be shown to be much lesser than the total number of codeword pairs. In other words 
we need to show that the sum of lattice points are concentrated in the thin spherical shell around the radius 



\fn2P. This is shown in Lemma 12 



• The third term has a summation which is difficult to evaluate and hence we bound it by an integral and evaluate 
the resulting integral. 

• Finally, we require that the number of codewords in each of the inner sphere must be sufficiently large to 
achieve rates close to ^ log(| + SNR). 

The Blichfeldt's principle(see Theorem [9]l can be applied to show concentration of codeword pairs. Lemma 13 



in Appendix C, is an application of the Blichfeldt's principle that guarantees that for any given lattice, we can find 
translations that satisfy < 4^. Also it makes sure that we can find enough codewords in the hyper spheres of 

radius VnP, such that we can achieve a rate of | log{^ + SNR). 

The Minkowski-Hlawka theorem (see Theorem [TT] in Appendix A) is used to establish the existence of at least 
one lattice such that the summation of the third term can be bounded by an integral. This theorem along with 



Lemma 13 in Appendix C, are used together in Lemma 8 to obtain bounds on both the second and third term. 



Hence we can effectively rewrite ([T9]) by using these bounds to get. 



pCe < 4^ + pr (to + Z" Be{to)) 

I'm 



+ 

We can bound the integral, as shown in [18, p. 623-624] to get. 



,in-^W-^ini2P + S))^^^ r.n'^-(.).. 
d„nr(^) Jo ^ ' 



(20) 
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P^^ < Pr (to + ^" Be{to)) + 4^ + 



vr— (/?(2P + (5))"/- siii"--^(6') 



dnnr(^) 



COS( 



(21) 



Now we next need to choose the appropriate values for 9 and dn to make the probabilities go to 0. For the 
second term, consider sinZ(to + Z, to) which is given by 



smZ{to + Z, to) 



I i^n{2P - S) + Z\\)^ + \\Z^^ 
Hence we choose sin ^ = w 2p^^_^_^2 ■ For the third term a good choice of dn is 



2P-S + a^ 



r-^ — . r,. W2(n(2P + 2(5))"/2(sin"e) 
dn = Vn{Vn{2P + 2S)) sin" 9 = r(n/2 + 1) 

This choice helps us to make the third term tend to for large n. The third term then can be rewritten as given 
below. We use the results in [11, p. 277], to bound the Gamma functions to get. 



dnuTi^) 



cos( 



2P + ,5 y/^ r(| + l) 



< 



0rsin6icos6' \2P + 26 ^ 

2 / 2P + <5 [i + r(^^)] 



V7rsin6'cos6' \2P + 2S 



r(^) 



This decays to exponentially as n ^ oo. 

Now, the achievable rate can be obtained from the number of lattice points in the sphere of radius \fnP. This 
value M(A„,Si), M(A„,S2), from the lemma, can be seen to be greater than Hence the rate R is given by. 



P> - logM(A„,s*i) 
n 

1 , ^ 

> — log 

n 8dn 



1 , 
> - log 
n 

>^log 
>^log 



1 



8 \{2P + 2d)sm^9 



n/2 



+ 



2P-S 



2P + 2d \2P + 26 a'^ 



1 P 

2 ^ ^ 



5' 



logs 



n 



Choosing 5' arbitrarily small, a rate of ^ log (5 + ^) can be achieved. 



D. Relationship with ML decoder 

There are some conditions under which the the minimal angle decoder will perform hke the ML decoder. 

(1) It can be easily seen that, for Gaussian noise, the ML decoder is equivalent to minimum distance decoder. 

(2) If all the codewords are concentrated (with high probability) in the thin shell then, we do not lose much by 
neglecting the codewords outside the thin shell. 

(3) Suppose the width of the thin shell is very small and almost all the codewords have approximately the 
same distance from the origin. Then, calculating minimum distance from received vector to the codewords is 
equivalent to calculating the minimum angle. 

(4) Suppose almost all the lattice points on the thin shell are codewords. Then, decoding to any lattice point in 
the thin shell does not sacrifice performance. 

We are deahng with Gaussian noise so the first condition is easily satisfied. In the course of our proof, we will 
observe by applying the Blichfeldt's principle that there exists a concentration of codewords at the thin shell. Hence 
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condition (2) also holds. Moreover, we will also let the width of the thin shell become arbitrarily small, hence what 
we are doing is very close to ML decoding. The 4th condition appears not hold to at low SNR, however. In this 
case, all lattice points in the thin shell may not be codewords. Hence, we think this may lead to the sub-optimality 
at low SNR. The theorem shows that, for SNR < 1/2, we get zero rate. We know that, for random Gaussian 
codebooks, joint decoding of both codewords is possible though. Therefore, we think that the ML decoder may 
give a better performance at low SNR. 

VIII. Joint Decoding Based Scheme 

While the aforementioned scheme is nearly optimal at high SNR, the performance of this scheme at low SNR 
is very poor. In fact, for P/a"^ < 1/2, the scheme does not even provide a non-zero rate. In this regime, we can 
use any coding scheme which is optimal for the multiple access channel and perform joint decoding at the relay 
such that and can be decoded. Then, the relay can encode © and transmit to the nodes. Any coding 
scheme that is optimal for the MAC channel can be used in the MAC phase. A simple scheme is time sharing 
(although it is not the only one) where nodes A and B transmit with powers 2P for a duration of n/2 channel 
uses each but they do not interfere with one another. In this case, a rate of 

1 / 2P\ 

RjD = ^logil + ^j (22) 
can be obtained. It can be seen that this is optimal at asymptotically low SNR, since log(l + snr) w snr for snr ^ 0. 




Fig. 4. Achievable Exchange Rates for the Proposed Schemes 

The performance of these schemes is shown in Fig. |4] where the upper bound and the achievable rates with 
these proposed schemes are shown. The achievable rate with the analog network coding scheme is also shown. It 
can be seen that our schemes outperform analog network coding. However, it must be noted here that the scheme 
proposed here requires perfect synchronization of the phases from the transmissions, whereas the analog network 
coding scheme does not. 

Clearly, we can time share between the lattice based scheme and the joint decoding based scheme in order to 
obtain rates of the form 

Rex = PRex,JD + (1 " P) Rex, Lattice- (23) 

It can be shown that between the SNRs of -0.659 dB and 3.46 dB, time sharing between the two schemes results 
in better rates than the individual schemes. In both the lattice based scheme and the joint decoding based scheme. 
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if the restriction to use n channel uses during the MAC phase and n during the broadcast phase is removed, i.e., 
only the total number of uses is constrained to be 2n, better schemes can be easily designed. Similarly, different 
power sharing may also lead to better schemes. 

IX. Extension to multiple hops 

A. Description 

We can extend the above results to multiple hops. We can again show that rate of ^ log (5 + ^) is achievable 
using structured coding even in the multiple hop scenario. It should be noted that the advantage of this scheme 
over the amplify and forward scheme [3] becomes more pronounced in the multi-hop case, since at each stage for 
the amplify and forward scheme, the channel noise is amplified and hence the amplify and forward scheme will 
suffer a huge rate loss as the number of hops increase. The problem model is shown in Fig. [5] The relay nodes and 
the nodes A and B can transmit only to the two nearest nodes. During a single transmission slot (n uses of the 
channel), a node can either listen or transmit. That is, it can not do both simultaneously. We explain our structured 
coding scheme using a simple example of a 3-relay network. The different transmissions are shown in the table 
given below. 

Here node A and node B have data that need to be exchanged between each other. Each node has a stream 
of packets. Node A has packets named ua,i, ua.2, ■ ■ ■ and node B has packets named 1, 2, .... In the first 
transmission slot the nodes A and B transmit. Nodes A, B transmit vectors xa,i and xb,i, respectively using 
our proposed lattice coding scheme. At the beginning of transmission, the node R2 has no data to transmit in the 
first transmission slot, and hence it remains silent. The node Ri and decode to xi 1 mod A and X2,i mod A, 
respectively. During the second transmission slot the nodes Ri and R3 transmit, while the other nodes remain 
silent. So, in each stage the nodes transmit and the listening nodes decode to a lattice point in the fine lattice in 
the Voronoi region of the coarse lattice. In every second transmission slot a new packet is transmitted to the relay 
nodes by the nodes A and B as can be seen from Table [I] During slots 2,4,6 nodes A, B transmits new packets 
into the relay channel. From this example, we can see that at the 4th slot the node A and B decode x^ 1 and xi 2 
respectively. This is because the node A receives (xi 1 + xi 2 + X2,i) mod A during the 4th transmission and, 
hence, since xi i,xi 2 are already known at the node A, the node A can decode to X2,i using modulo operation. 
The same argument holds for node B. From every two transmissions from this stage a new packet can be decoded 
at each node. This shows that for sufficiently large number packets we can achieve the rate of ^ log(^ + ^). A 
similar encoding scheme can be used for L = 2 nodes also, in the first slot, node A and R2 transmit, while the 
others listen. In the next slot Ri and node B transmit while the others listen and decode. Again the same rate of 
jlogd + ^) is achievable. 

B. Achievable rate 

Theorem 8: For the multi hop scenario defined with L hops, an exchange rate of ^log(| + ^) is achievable 
with nested lattice encoding and decoding. 



Node A 



Relay Node 



Relay Node 



Relay Node 



Relay A/odes ffj /?^ 



NodeB 



Fig. 5. Problem model for multi hop 

Proof: We can easily prove that the theorem holds for a general case L relay nodes in between. In our coding 
scheme in every two slots a new packet is sent out from the nodes A and B. After an initial 2L transmission slot 
delay, in every two slots the relay nodes receives a new packet from the other nodes. Here, we mean that every two 
slots the relay node decodes to a lattice point which is a linear function of a new packet. Hence, at the decoding 
stage at the nodes B and A, we can decode after every two slots since only one variable is unknown, since only 
one new packet (or function of new packet) moves from one node to the other. Hence, we can still achieve the 
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TABLE I 

Table Showing Sequence of Packets in the Multihop Case 



Slot 


Node A 


Node Ri ( mod A) 


Node R2 ( mod A) 


Node R3 ( mod A) 


Node B 


1 


Transmits xi,i 


Xl.l 


Transmits 


X2,l 


Transmits X2,i 


2 


RpTTlfllTlQ ^llpTlt 




Xl.l -^2,1 


± 1 CllliMlll LO 


R PlTlfl 1T1 Q ^1 IPTlf 
AvCllltllllS vJllCllL 


3 


Transmits xi,2 


Xl,2 + Xl,l + X2,l 


Transmits 


Xl,l + X2,l + X2,2 


Transmits X2,2 


4 


Decodes X2,i 


Transmits 


2(X1,1 + X2,l) + 
Xl,2 + X2,2 


Transmits 


Decodes xi,i 


5 


Transmits xi a 


2(xi,i + X2,l) 
+Xi,2 + X2,2 + Xi,3 


Transmits 


2(xi,i + X2,l) 
+Xi,2 + X2,2 +X2,3 


Transmits X2,3 


6 


Decodes X2,2 


Transmits 


4(xi,i + X2,l) + 
2(X1,2 + X2,2) + 
Xl,3 + X2,3 


Transmits 


Decodes xi,2 



rate of ^ log(^ + i^^)- Moreover, the functions in each stage are bounded for a finite L and, hence, we can always 
perform the decoding at the receiver nodes. 

■ 

X. CONCLUSION 

We considered joint physical layer and network layer coding for the bi-directional relay problem where two nodes 
wish to exchange information through AWGN channels. Under the restrictive model of the MAC and broadcast 
phase using n channel uses separately, we showed upper bounds on the exchange capacity and constructive schemes 
based on lattices that is nearly optimal at high SNR. At low SNR joint decoding based schemes (optimal coding 
schemes for the MAC channel) are nearly optimal. These schemes outperform the recently proposed analog network 
coding. Interestingly, our result shows that structured codes such as lattice codes outperform random codes for such 
networking problems. We also showed that minimal angle decoding also leads to similar results. We also showed 
extensions of this scheme to a network with many relay nodes, where the advantages of the proposed scheme over 
simple amplify and forward will be higher than in single relay case. 

Appendix A 

Blichfeldt's Principle and Minkowski-Hlawka Theorem 

Theorem 9 (Blichfeldt's Principle [16]): Let / be a Riemann integrable function with bounded support. If A„ 
is a lattice with fundamental region P„ then 

j^^ f{s)dV{s) = ^ ( 1^ -^^^ + 
Let us define the following function as follows, 

= J J Xr(u)xT(v)dl/(u)dF(v). 

Here = {Vn{VnP))'^ , represents the square of the volume of an n-dimensional sphere of radius VnP. dV 
represents the n-dimensional volume element in rectangular co-ordinates. 

We next estabhsh the following corollary 

Corollary 10: 

Fe= / / Me(An,si,S2)dF(si)dF(s2). 

Js\ J S2 

Proof: Let us define a function 

/(u, v) = XT(u)xr(v). (24) 

For a fixed u, /(u, v) can be seen as a function with bounded support and also can be seen to be integrable. Hence 
we can apply the Blichfeldt's principle to get 

h{n) = J f{u,v)dV{v) = (^Y. /(u,/i2 + S2)j dF(S2). 
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Now h{u) can again be seen as a Riemann integrable function with bounded support, and hence the Blichfeldt's 
principle could be applied again to get the following. 



y® = j h{u)dV{u) 



(a) 



jp /(/il+Sl,/l2+S2)j (iy(s2)j (iy(si) 



p„ JP, 



Me(A„, si, S2)(iy(si)(iy(s2) 



Above (a) follows since we have a finite number of non-zero terms, and hence the integral and the summation 
can be interchanged. Also in (b), M0(A„, 81,82) is the number of pairs of lattice points for the translations 81 and 
82. 

■ 

In short the Corollary 8 relates the square of the volume of an n-dimensional sphere and number of pairs of 
lattice points for different translations. 

Theorem 11 (Minkowski-Hlawka): Let / be a nonnegative Riemann integrable function with bounded support. 
Then for every d G 1R+ and n > 2, there exists a lattice A„ with determinant det(A„) = d such that 

d Y < [ fdV{^) 

9eAn\{0} 

The Minkowski-Hlawka theorem gives us a way to connect a series of discrete sums with a continuous integral. 
This will find appUcations in our probability of error calculations. 



Appendix B 
Hyper Volume Concentration Lemma 

Lemma 12: Let V%, be defined as 



K = 1 1 Xt(u)xt(v)xt^(u + ^r)dV{n)dV{^r) 



then we can choose n sufficiently large such that, ^ < 6, for every given positive 6. 
Proof: 

First we perform a change of variables in the integral, by substituting x = u + v. This gives, 

V^ = J J XTiu)xTi^-u)xT'^{^)dViu)dV{^) 
Let us consider first the inner integral, for a fixed x,given by. 



j Xt(u)xt(x - u)dy(u). 



This geometrically represents the hyper volume of intersection of two hyper-spheres, whose centers are at a distance 
||x||, from each other. This is pictorially shown in Fig. |6] The calculation of hyper volume of intersection, reduces 
to obtaining the hyper volume of the conical section and a cone. This is shown pictorially in the second diagram 
in Fig. [6] Here opq represents the hyper cone and oprq represents the conical section. Here we denote by Vcsdxl) 
to represent the volume of the conical section and Vcol |x|) as the volume of the cone. The integral can hence be 
be evaluated as 

' Xr(v)xT(x - ^r)d^r = 2{VU\^\) - VM)). 
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Fig. 6. Geometrical interpretation of the integral 



To simplify calculations we can bound the integral as, 

/ Xt(v)xt(x - ^r)dV{^r) < 2T4,(|x|). 
Jt 

From the figure we can see oe = ||x||/2. Hence the half-angle 9 can be calculated as cos 9 
Hence with the defined 9, we can calculate the hyper volume as, 



/nP 2VnP 



sin" ^^/^d-i/" 



Hence, 



Yk 



< 2 



?/)=0 



11x11 



n — L 1 

TT^^ (nP) - 



-XT' (x)dy(x) 



(25) 



(26) 



But ^0 is given by V'^, where V is the hyper volume of an n-dimensional hyper sphere of radius \/nP denoted 
by K(V^). 

The value of Vcs( ||x||) depends only on the distance of x from the origin. To make evaluation of the integral 
easier, we change the volume element to circular co-ordinates and integrate. Thus the integral now becomes. 



< 2 



nvr 2 



Vcsjr) 



(27) 



where is defined as the union of closed intervals, (will be given later). Now substituting Vcs{r) from above gives. 



< 4 



Ti — ± n 

71^2^ (nP) 2 



Fffi - 'r(f + 1) nrc^) Jr 



dipdr 



Now let us choose cos 9 
V' 



— '1-1 , 

nvr 2 vr 2 (jiP) 



-^r(§ + i) nr(^) 



. Then change of variables gives 

(nP)"/2 cos"- 10 sin 



fee' 



VnWnPY 

2"sin"-2v^ 
^=0 Vn{V^? 



(28) 



(29) 
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Substituting for Vn{VnP), we get, 

^® . 4 r(| + i) 



- ^ r(^) y^eei ^=0 

/ 2"cos"-^6'sin(9 / sin"" 



(30) 



< / 2"cos"-^^sin^ / sm^-^ i)dil)de 

(31) 



+— / 2"cos"-^^sin^ / sm^-'^ i)dil)de 

Now we will use the bound by Shannon for the first term. We can apply the bound for ^ < 7r/2. Hence we split 
the integral into two terms to apply the bound. For the second term we will bound sin^' by 1. This gives, 

I ./flcfl' nrn 

' 3 



y® r(V) j0e(9^n[o,f+„] (ra-l)cos^ 

Simphfying things further we get, 

. 4 r(| + l) /• 



(32) 



< ^ / 2"-^sin'*-^0cos"-'0tan^d0 



4 (n-l)r(| + l) /• ,n-l_n-l.....^ 

r(^) 7,,,^n 



(33) 



^ / 2"-icos"-iesin^-d^ 



Next we bound tan^ in the first term by tan(7r/3 + rf) and sin^ in the second term by 1. We also bound the 
factorial using the bound given in Urbanke. This yields. 

Again we can see that since does not take the value 7r/4 we can bound the first term appropriately. In the 
second term the maximum value of cos G can be used to bounded it appropriately. This is given below as follows. 

^ 4 f(l + r(-^))tan(f + r;) vr _ .^tt 
Fe-v^ r(r^) ^""^2 '^^^ 2 

+ A(^^^Mi±rm)-pcos(-+,)]-- 
^ r(^^) 2^ ^3 2 

From the above we can easily see, that both terms tend to as n — >^ oo. From this the lemma follows. 



20 



Appendix C 

Application of Blichfeldt's Principle to Show Existence of Good Translations 

Lemma 13: Let A„ be a lattice having a fundamental region P„ and let det(A„) be it's fundamental volume and 
define 



Then 



P„* = {(si,s.)eP.xP„:M,(A„,s.)>|^; 

^^(^"'^^^ ^ 8de^(Aj' Me(A„,s„s.) ^ ^l^J 
Vq^<2 [ Me(An,si,S2)dl/(si,S2), 



where > and can be made arbitrarily small for sufficiently large n 
Proof: Let us define the following sets, 

^.= {s.eP„:M,(A„,s.)>^^ 

8det(An) 



Gn = < S2 G Pn : M2(A„,S2) > 



I M©(A„,si,S2) V© 

Therefore P* = (P„ x G„) P) 0„. Define the complements here. Hence 

PnXPn = P* \J{{Fn^ G„) H O^} (J {(P„ X G^f} (36) 

Hence we have 

Fe= / Me(A„,si,S2)cZy(si)dy(s2) (37) 
ye</ Me(A„,si,S2)dF(si)dy(s2)+ / Me(A„,si,S2)dF(si)dF(s2) 

+ / Me(A„,si,S2)dy(si)dy(s2) 

We can replace the second integral, using the condition that the translations are not in the set 0„, to get a bound 
on M0 as shown below, 

Fe</ Me(A„,si,S2)dF(si)dF(s2) + ;^ / M^(A„,si,S2)dF(si)dF(s2) 

+ / (EE /(/ll + Sl,/i2 + S2) I dF(si)dF(S2) 

Since, C P„ x P„, we can bound the second integral again as shown below. 

< / Me(A„, si, S2)dy(si)dF(s2) + / M^(A„, si, S2)dF(si)dy(s2) 

+ / EE XT(/il+Sl)xT(/i2 + S2) I dF(Sl)dF(S2) 

By using Blichfeldt's principle and following similar steps as in Corollary 4, we can show 
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/P„XP„ M^(A„,Si,S2)dF(Si)dy(S2) = V^. 

<j^^ M^(A„,si,S2)dF(si)dy(s2) + 



4 



+ / I E E ^^(^1 + Sl)Xr(^2 + S2) I dF(si)dF(S2) 



Next we change the double summation into a product of two summations, to get 

4 



y®<j^ Af^(An,Sl,S2)(iF(si)dy(s2) + 



+ / E Xt(^i + si) E Xr(/i2 + S2) Unsi)(ins2) 

The summation X^/^^g^ Xt(^i + si) can be seen to count the number of lattice points in the sphere r„, for 
the translation si. This is by definition equivalent to M(A„,si). Similarly we can replace the other summation by 
M(A„,S2), to get 

M^(A„,si,S2)dF(si)dy(s2) + ^+ [ M(A„,si)M(A„,S2)dF(si)dF(s2) 



Since (F„ x C {Fn x P„) U (P„ x G„) we can bound the integral to get 



y(B<j^ M^{An,Si,S2)dV{si)dV{S2) + 



M(A„, si)M(A„, S2)dy(si)dF(s2) 



+ / M{An,Si)M{An,S2)dV{si)dV{S2) 

We next use Blichfeldt's principle to simphfy the integrals. 

l/e < / M^{An, Si, S2)dF(si)dy(S2) + ^ + Vn{V^) [ M(A„, si)dF(si) 

+ K(VnP) [ M{An,S2)dV{s2) 
We use the condition si ^ F„ and S2 Gn, to get 

Fe < Me(A„,si,S2)dy(si)cZF(s2) + ^ + M^^^ ^sd^) Jpc "^^^^'^ 



8det(A„) 

Since C P„ and /p (iy(si),/p dV{s2) = det(A„), we obtain, 

< Me(A„,si,s2)dy(si)dF(s2) + ^ + K(v^) '^"^^^ + y„(v^) ^"^^^ 
Finally using = (V^(\AiP))^, we obtain 



Fe<2^ Me(A„,si,S2)(iy(si)(iF(s2) 



From the above lemma it can be seen that the measure of P* must be non-zero and hence, there must be at least 
some translations (si,S2) of the lattice, where the requirements hold. 
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Appendix D 

Minkowski- Hlawka Theorem to Show Good Lattices Exist 
Lemma 14: There exists translational vectors s^, 83 such that the following holds 
1 



Me(A„,s*i,s^) 



XI m ^'e(Xl + X2,C/ + Xi +X2)XrA (5f + Xi + X2) 



< 2 



(n- l)7rV(n(2P + 5))"/2 



^0 



sin" '^{x)dx 



and 



8det(A„) 8det(A„) Me(A„,s^,s^) 

Proof: 

First let us define the function /(z) as follows 



/(z) 



:/ / p0(u + v,z + u + v)xT^^(fi' + u + v)xr(u)xT(v)xT^^(u + v)(iF(u,v) 



Clearly /(z) is a nonnegative integrable function and has bounded support. Hence we can apply the Minkowski- 
Hlawka theorem and find a lattice A* , such that 



9eA„\{0} 



Consider the integral 



M©(A„, 81,82) 



M©(A„, 81,82) 



XI X P0(xi+X2,5 + Xi + X2) 



(xi,x2)eC^fleA„\{o} 



Xt^^{9 + xi + X2) dF(8i, 82) 

/= / V pe(xi + X2,5 + Xi +X2)Xt^(5 + Xi +X2)(iy(8i,82) 



(xi,x2)ec^ 9eA„\{o} 

< / XX Pe(xi + X2,5 + Xi +X2)Xt^(5 + Xi +X2)dF(8i,82) 

•^^"^■P"(xi,x2)eC^9eA„\{o} 
The above follows as P* C P„ x P„. 

/< / X X +Sl + /l2 + S2,5 + ^1 +Sl + /l2 + S2) 

•''fnXP„ (hi,h2)6A„xA„ geA„\{0} 

XT^^id + hi+Si+h2 + S2)XT{hl + 8i)xT(/i2 + S2)XT^^(^1 + Si + ^2 + 82)dF(8i, 82) 

In the above equation we substituted xi = /ti + 81 and X2 = ^2 + S2. Next applying the Blichfeldt's principle 
twice we obtain, 

I< / p6i(u + v,5r + u + v)xr^^(5 + u + v)xr(u)xT(v)xr^^(u + v)(iy(u,v) 

Here u = /ii + si and v = /12 + 82. We can next take the summation outside the double integral, as we are 
dealing with a finite number of non-zero sums. This gives, 

/< X / / Pe(u + v,5 + u + v)xrv(ff + u + v)xr(u)xr(v)xrv(u + v)(iy(u,v) 
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Next we use the Minkowski-Hlawka theorem to estabhsh that there exists at least one lattice A* such that the 

summation can be bounded by an integral, as shown below. In short the Minkowski-Hlawka theorem gives an 
existence result that such a lattice can be found, but does not give a way to find such a lattice. 

I<^l I I pe(u + v,w)xrA (w)xT(u)xT(v)xr^(u + v)dy(u,v)(iy(w) 

dn J JR" jRn ^ ^ 

In the above we have replaced w + u + v with w. Again we change the order of integration to get, 



- :r / / / (" + V, w) XT^(w)dy(w) 

«n Jr" Jr" V ^ 



Xt(u)xt(v)xt^_(u + v)dy(u, v) 



The inner integral can be seen to be independent of u + v, and the outer integral can be seen to be lesser than 
V@. Hence we can express the resultant integral as. 



1 < 



V f 

-r / P0(so,w)xT^(w)dy(w). 

dn J ^ 



dr, 



Hence we next evaluate the integral using similar steps as in [11], first by changing to spherical coordinates. 

y r^n{2P+S) 
/ / 

dn J^n{2P-5) 

where d^(w) is the (n — 1) -dimensional volume element of a thin spherical element at a distance r from the origin. 
Next we use the definition of pe to get. 



Pe (so, w) xr^^(w)(iA(w) 



' w: w =r 



/ < 



w: w =r 



dfi J yJn{2P-5) 

We evaluate the probability by conditioning on Z 

y i-^ni2P+S) 



Pr (7r(so) G 50 (w)) Xt^ (w)cZ^(w) 



dr. 



I < 



/ / / Pr(7r(so) + G Be{w)\Z'' = z)Pt{Z'' = z)dV{z) 

in Jy/n{2P-5) 



XTA_(w)d74(w) 

Since the function is non-negative, the order of integration can be interchanged to obtain 



dr. 



I < 



-^n(2P-<5) iw;||w|| 



[Pr(7r(so) + Z"G50(w)|Z- = z)] 
Xt^ {w)dA{w)dr] Pr(Z" = z)dF(z). 



Due to the conditioning on Z", the conditional probability becomes deterministic and is equivalent to the cross 
sectional area of a hyper cone of half-angle 9, intersecting with a hyper-sphere of radius r. Note that in the result 
in [11], the area of cross section must be a function of r and not of VnS'. This gives. 



I < 



dn 

This in turn yields 

/ < 



Pr(Z'" = z)dV{z) 



r^/n{2P+S) 
Vn(2P-5) 



(n — l)7r''2 ^ 



^0 



sin" ^{x)dx 



dr. 



(n - l)7r"^ ((n(2P + Syf /-' - ( n(2P - ^))"/^) . ' 

j sm iXjU/X 
Jo 



nr(^) 



/ < 



dn 



nr(a±l) 



/ 

Jo 
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Next using Lemma [13] we can bound the resultant integral to get, 



dF(si,S2). 



Now comparing the integrals and also knowing that the measure of P* is non-zero we can see that there must be 
at least one such translational vector pair (si, S2) such that the required assertion of the lemma holds. ■ 
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